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ABSTRACT 

We study the evolution of non- linear structure as a function of scale in samples from the 
2dF Galaxy Redshift Survey, constituting over 221 000 galaxies at a median redshift 
of z = 0.11. The two flux-limited galaxy samples, located near the southern galactic 
pole and the galactic equator, are smoothed with Gaussian filters of width ranging 
from 5 to 8 h~^Mpc to produce a continuous galaxy density field. The topological 
genus statistic is used to measure the relative abundance of overdense clusters to void 
regions at each scale; these results are compared to the predictions of analytic theory, 
in the form of the genus statistic for i) the linear regime case of a Gaussian random 
field; and ii) a first-order perturbative expansion of the weakly non-linear evolved field. 
The measurements demonstrate a statistically significant detection of an asymmetry 
in the genus statistic between regions corresponding to low- and high-density volumes 
of the universe. We attribute the asymmetry to the non-linear effects of gravitational 
evolution and biased galaxy formation, and demonstrate that these effects evolve as a 
function of scale. We find that neither analytic prescription satisfactorily reproduces 
the measurements, though the weakly non-linear theory yields substantially better 
results in some cases, and we discuss the potential explanations for this result. 

Key words: cosmology: observations — galaxies: statistics — large-scale structure 
of Universe 



1 INTRODUCTION 

The large-scale structure of the cosmological density field 
contains important information about the initial conditions 
and evolution of the Universe. This structure evolves solely 
under the influence of gravity and its properties are a strong 
function of scale. Specifically, on the largest scales the mor- 
phology of the structure will reflect its primordial shape, ex- 
pected to be that of nearly scale-invariant Gaussian random 
density perturbations ( [Bardeen et al.||l983[ |Peebles||1993[ 
Peacock 19991, generated by the scalar field, or fields, re- 



sponsible for cosmological inflation ( 


Guth|1981 


Albrecht & 


|Steinhardt|1982||Liddle & Lyth|2000 


|Linde|2005 


\. That this 



description accords with observation is strongly supported 
by galaocy redshift surveys ( GoUer & Huchra ]1989' Sh eet- 1 
|man et al.|p96, .CoUess et al.„2001[ [York et al.„2000J and 



microwave background measurements (Bennett et al. 1994 



Spergel et al. 20061, as well as the methods used to ob- 
tain information about morphology of the large-scale struc- 
ture ( |Gott et al.|1989||Melott et al.|1989l|Gott et al.|1992 | 



Indeed, the tools for measuring these properties — the 
topology of the large-scale structure — are beginning to find 
wider application. It is more than 20 years since the publica- 



tion of the seminal work of Gott et al 
has matured substantially. 



( 1986[ ), and the field 
whether 



The initial question 
the topology of the cosmological density field is consistent 
with inflation — has been answered in the affirmative through 



repeated analysis 


Gott et al. II 1989 


Vogeley et al. 


1994 


Canavezes et al.|1998| |Hikage et al. 


2002, ,Hoyle et al. 


2002 


Hikage et al.|2003 


Park et al.|2005 


James et al.|2007 


1 and 



studies of the departure from this as a result of non-linear 
gravitational evolution, rather than as an initial condition, 
are an exciting new direction for observational studies (c/. 
Melott et al.|1988| for pioneering work with simulations). 
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The classic topological statistic is the genus number of 
constant-density surfaces through the field, with the sur- 
faces ordered so as to excise increasingly larger volumes. 
Each genus number is a measure of the connectedness of 
the structure at a particular density; the resulting locus, 
called the genus curve, is genus number as a function of 
density relative to the median. The most important quali- 
tative feature of this curve, for a Gaussian random field, is 
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symmetry about this median density — the high- and low- 
density regions of the immediate post-inflation Universe are 
equivalent and interlocking, giving rise to a so-called sponge 
topology. With regard to the configuration of regions above 
and below median density, this is one of only three possibil- 
ities: it is the middle ground between the universe evoked 



by Press & Schechter (19741, a connected void populated 
with high-density clusters — a meatball topology — on the one 
hand, and the single web of high-density structure punc- 
tured by voids — a swiss-cheese topology — on the other. All 
three are idealisations: a spectrum runs smoothly from one 
extreme to the other, leading to the more common charac- 
terisation in terms of meatball- and swiss-cheese- s/ii/i away 
from the norm. 

The genus curve has proven to be an especially valu- 
able measure, as it can be predicted analytically for the case 



of the Gaussian random density field ( Doroshkevich 1970 



|Adler|198l||Bardeen et al.|1986|[Hamilton et al.|1986[ ). That 
the topology of structure will be scale-dependent should 
follow from the predictions of inflationary models together 
with observations of the local Universe, where isolated high- 
density structures predominate. This will produce a transi- 
tion in the genus curve from symmetry at (sufficiently) large 
scales to a strong asymmetry in the non-linear regime of 
structure growth on small-scale and at late times. Naively, 
such departures would be expected to arise at about the 
correlation length, where 5p/ p ~ 1 and modes of different 
sizes are beginning to interact. However, the form of the 
genus curve for a general field that does not obey Gaussian 
statistics has resisted analytical description, despite signifi- 
cant theoretical headway for specific cases of non-Gaussian 



fields (Hamilton 1988 Matsubara fc Yokoyama 1996 Hik- 



age et al. 20071 and second-order perturbation expansions 



of the Gaussian result (based on the methods of Kofman 



et al. 1994 Juszkiewicz et al. 1995 1 into the weakly non- 



Seto et al. 



Matsubara 



linear regime (Matsubara 1994 Matsubara & Suto 1996 



19971. The latter case is especially salient and 



( 2003 1 provides a key conclusion: that the ef- 



fect of the weakly non-linear evolution on the genus curve is 
relatively modest. 

It is against these expectations that the data must be 
tested. The weakly non-linear approximation will be ob- 
served to break down on sufficiently small scales and the 
behaviour of the genus curves throughout and beyond the 
regime in which it is valid provides higher-order informa- 
tion on the evolution of the density field than would be 
garnered with correlation functions or power spectra. Such 
a measurement requires a survey of a large volume with 
statistical properties that are well understood. The consid- 
erable sophistication of the statistical methods that have 
been brought to bear on galaxy redshift surveys since the 
turn of the century provide an opportunity to address this 
topic directly. 

In this paper, we use the final data release of the Two- 
degree Field Galaxy Redshift Survey (2dFGRS) to make 
measurements of the genus statistic on the cosmological den- 
sity field on scales ranging from 5 to 8 /i~^Mpc. This builds 
upon the analysis of statistical properties of the galaxy 
distribution with this data set, including the power spec- 



trum (Cole et al. 20051, void probability function (Croton 
|et al.| 2004b[), the two-po int and higher-order correlation 



the genus statistic measured in the linear regime as a test 
of the infiationary hypothesis ( James et al. [20(57 1. Section [2] 
describes how the density field is reconstructed from the 
galaxy survey and the restrictions this places on subsequent 
measurements. The genus measurement itself is the subject 
of Section [S] as are the physical meanings of the different 
qualitative results that may be obtained. Section [4] presents 
the results and method of error analysis for genus measure- 
ments of the 2dFGRS data, in which we test for the presence 
of a shift in the genus curve and discuss the implications 
of these results. Throughout, we use a cosmology in which 
fi™ = 0.27, f^A = 0.73 and Ho = lOO/i km sec"^ Mpc"\ 
with h = 0.72. 



2 SAMPLES FROM THE 2DFGRS 

At completion, the Two-degree Field Galaxy Redshift Sur- 



vey ( [Colless et al. 2001 1 measured spectroscopic redshifts 
for 221 000 galaxies in regions of the sky well separated to 
ensure statistical independence. With a limiting magnitude 



of 19.45 in the 6,j band (Blair & Gilmore 19821, a median 



redshift of 2 = 0.11 is obtained with close-to-uniform sam- 
pling across the sky. The survey comprises two regions lo- 
cated near the South Galactic Pole (SGP) and slightly to the 
north of the Galactic equator (NGP), along with a number 
of smaller random fields that will not be used for the analysis 
in this paper, giving a total survey area of over 2000 deg^. 
The larger SGP region is a strip of approximately 90° x 15° 
(2l''40'" < Q < 03''40"\ -37?5 <S < -22?5) in which the 
redshifts of 115 492 galaxies are avilable for use in a sta- 
tistically complete sample; the NGP strip is approximately 
75° X 10° (09''50'" < Q < 14'^50"", -7?5 < <5 < +2?5) and 
contains 79 696 galaxies. 



2.1 Selection effects 

The data reduction includes extinction and k + e correc- 
tions, as well as masking of the survey area to account for 
several distinct selection effects: i) variation in magnitude 
limit between the two-degree fields that tile the full sur- 
vey regions; ii) variation between fields in the fraction of 
objects with high-quality spectroscopic redshifts; and iii) 
variations due to magnitude-dependent redshift complete- 
ness. These corrections are performed with the publicly 
available software of Peder Norberg and Shaun Cole, docu- 



mented by Norberg et al. (20021. Information on the soft- 



functions (Hawkins et al. 2003 Croton et al. 2004a I and 



ware, and the code itself, is provided on the 2dFGRS website 
at http : //www2 . aao . gov . au/2dFGRS/. 

Correcting for these effects addresses statistical incom- 
pleteness across right ascension and declination. When gen- 
erating samples that maintain a constant number density 
with redshift, the effect of decrease in absolute magnitude 
limit with distance must be addressed. When the luminosity 
function of the population of observed galaxies is well under- 
stood, it is possible to estimate the fraction of galaxies — as 
a function of magnitude and redshift — that go unobserved 
as a result. The inverse of this number can be used to weight 
the galaxies that are seen, which serve as markers for the 
hidden population. 

The combination of all statistical completeness effects 
is estimated with a single selection function, giving a weight 
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to each galaxy. Because such a weighting scheme is useful 
to many different analyses of a single data set, the 2dFGRS 
selection function was computed by Norberg et al. ( 200^ ; 



they describe the estimators used to produce the weightings 
as well as other salient aspects of the &j-band luminosity 
function. Galaxy samples produced in this manner are flux- 
limited and retain all objects in the original catalogue at the 
expense of uncertainty over the exact values of the weighting 
and limitations imposed by the estimation. 

An alternative to using a weighted galaxy sample is to 
remove the redshift-dependent effect by drawing a sample 
comprising just those objects that would be visible at the 
edge of the sample region, given the magnitude-limit of the 
survey {e.g. Martinez & Saar 20021; such volume-limited 



samples are the more common way of making a measure- 
ment of the topological genus from a redshift survey(e.g., 



Gott et al. 


|1989 


Canavczes et al.||1998; Hoylc et al. 2002; 


Park et al.| 


2005 




Gott et al.||2008 


I; 


Vogeley et al. 


(1994 


1 is 



a notable counterexample. Flux-limited samples of the 2dF- 



GRS were used by James et al. (20071, who demonstrated 



that i) the choice of flux-limited catalogues reduced the noise 
in the measurement without introducing spurious effects; 
and ii) for the purposes of topological analysis, the selec- 
tion function of the 2dFGRS is best restricted to the region 
z < 0.2. 



NGP 


A 


^max 


V (/106) 


N 


1/n 


res 


P{%) 


5 


0.12 


6.652 


48892 


136.0 


0.20 


11.44 


6 


0.15 


12.98 


63431 


204.6 


0.25 


10.31 


7 


0.19 


25.74 


73897 


348.3 


0.29 


9.33 


8 


0.20 


29.86 


75268 


396.7 


0.25 


10.18 


SGP 


5 


0.10 


5.733 


42441 


135.1 


0.20 


10.94 


6 


0.15 


19.19 


82399 


232.9 


0.25 


7.33 


7 


0.17 


27.66 


92893 


297.7 


0.29 


7.45 


8 


0.20 


44.11 


103616 


425.7 


0.25 


7.68 



Table 1. Description of the point samples used in the genus mea- 
surement: for a given smoothing scale (A, in /i~^Mpc), a value for 
the redshift limit (zmax) of the sample is chosen given the con- 
straints of Figure[lj this defines the volume (V, in cubic comoving 
megaparsecs) , number of galaxies (TV) and number density (n) of 
the sample and the resolution of the array cells (in units of A). 
The sum of galaxy weights (the 'power', a proxy for mass) is not 
conserved by the smoothing process, which pushes power out- 
side the edges of the survey region. By normalising the smoothed 
galaxy distribution to a smoothed constant field, no systematic 
effects are introduced by this process, though the loss of power 
reduces the quality of the signal. 



2.2 Sample constraints 

The choice of the maximum redshift to which a galaxy sam- 
ple should extend depends on the properties of the underly- 
ing catalogue and the scale that is to be studied. This length 
scale is imposed through the smoothing process, where it 
is identified with the characteristic width of the filter. The 
most restrictive requirement is that the smoothing length. A, 
exceed the average nearest-neighbour separation of galaxies 
in the sample volume. The latter quantity derives from the 
mean number density of the survey, n; it is proportional to 
the cube root of the volume per object, n~^^^. For a Poisson 
point process, the mean nearest neighbour separation is 

1/3 

(1) 



^1/3 



V 



7r3/2iV 



where V is the volume of the survey and N is the num- 
ber of galaxies. Ifowever, the cosmological density field is 
not a continuous Poisson field, nor is the drawing of points 
from the background an unbiased process, so galaxies in red- 
shift surveys should be assumed not to be a Poisson process, 
though the qualitative form of the relationship between Inn 
and n should not change. A safe choice of smoothing length 
for a Gaussian filter is 

1/3 



> ^nn 



(2) 



this prescription has been used previously with success by 
[Vogeley et al] ( p94| ). The average number density is a func- 
tion of the size of the survey; for a survey through a fixed 
window in right ascension and declination, n is a function 
of 2:max ouly. lu this way, the scale to be probed infiuences 
the choice of maximum redshift used when drawing samples 
from the 2dFGRS catalogue. 

Another relationship between A and Zmax arises from 
considering the number of resolution elements in the 



smoothed sample. A high number of resolution elements is 
desirable in order to recover a statistically significant sig- 
nal, though there is no critical threshold. The number of 
resolution elements is 

iV.cs = 4r^ = ^^^A= ( (3) 



^3/2^3 



7r3/2Af,c 



A value of N^cs = 10 has been selected as indicative of a 
measurement of reasonable significance. 

As has been stated, the choice of Zmax is also restricted 
on account of the survey selection function; Figure 2 of 



James ( 2005 I shows a break in the slope of the cumulative 

max ^0.2, lead- 



number density as a function of redshift at Zn 
ing to a natural choice for cutting off the maximum redshift 
at which an object should be included in this measurement. 
This defines a relation within the Zmax-A plane, albeit one 
that is independent of A. This plane, along with the the 
constraints these three relations provide, is shown in Fig. [l] 
Point samples are drawn from the NGP catalogue out 
to those values of Zmax that will allow smoothing on scales 
of A = 5 to 8/i"^Mpc in l/i-^Mpc increments. The NGP 
catalogue has higher number density at low redshift, as it 
probes a smaller volume; the measurements in samples from 
the SGP catalogue will be correspondingly stronger and are 
also made above 5/i~^Mpc. Table [l] shows the salient pa- 
rameters of each of the point samples drawn from the NGP 
and SGP slices. 



2.3 Sample construction 

The flux-limited galaxy samples from the NGP and SGP 
catalogues include all objects out to a maximum redshift 
given the constraints above. Initial input from the 2dFGRS 
catalogue takes the form of a list of redshift, right-ascension 
and declination values, as well as a weight for each galaxy 
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NGP SGP 




max max 



Figure 1. Constraint diagrams for ttie point samples drawn from the two 2dFGRS catalogues. Considerations of the selection function 
(dashed green), spatial number density (solid blue) and resolution (dashed red) restrict values of 2max and A to the central triangular 
region. The constraint on the number of resolution elements is only indicative {Nres = 10^); it is desirable to draw samples as close the 
solid line as possible, so as to maximise the signal-to-noise of the measurement while avoiding spurious signal from sparse points being 
treated as isolated clusters. The points indicate the choice of A and Zmax employed in the samples used in this work. 



corresponding to the inverse of the completeness. The red- 
shift is converted to comoving distance {e.g. |Hogg|1999[ ) by 
numerically integrating 



Ho Jo 



dz' 



fi„ (1 zf + (1 - 0,„ - Oa) (1 + zf + ^^ 



;(4) 



the distance and angular coordinates are converted to Carte- 
sian coordinates, which are then rotated through the y-axis 
(corresponding to declination) so that the volume of the 
bounding box is minimised. 

For the purpose of the smoothing and the genus mea- 
surement, the galaxies are represented within a three- 
dimensional array, binned using a nearest-grid-point algo- 
rithm, with resolution set to be well below the smoothing 
length; using bins of size 2/i~^Mpc is deemed adequate for 
smoothing of 5/i~^Mpc or more. Figures [landU] show the 
NGP and SGP slices embedded in this data structure. Cells 
outside the survey region must be flagged as such, rather 
than empty; they are given negative values. 

However, by discretising the spatial set of galaxy loca- 
tions, those regions inside the slice removed during masking 
must be also be flagged. If this is not done, these cells will 
act as false voids, leading to a Swiss-cheese signature in the 
genus measurement. Masking for the survey regions based on 
the composite survey completeness maps is applied by flag- 
ging regions of the a-5 plane within and around the NGP 
and SGP slices that are not part of the survey. The flagged 
regions are projected along the line-of-sight to define a pen- 
cil beam of flagged cells within the data array. The incursion 
of these cut-outs around the edge of the slices (and the SGP 
slice in particular) is apparent in Figures |2]and[3] 



The correctly masked data array is a three-dimensional 
structure with a flagged (negative) value in cells outside 
the survey bounds, as well as in masked regions within the 
bounds. The remaining cells have positive value and repre- 
sent the weighted galaxy sample. 



2.4 Sample Smoothing 

To retrieve an estimate of the cosmological density fleld, 
the galaxy sample is smoothed through a Gaussian window, 
though spherical top-hat ( Hamilton et al.|1986 1 and wavelet 
techniques ( Martfnez et al.|2005 I have also been considered 
for the purposes of the genus measurement. The width of 
the smoothing kernel acts to select a scale, in Mpc, of the 
features of the fleld to be emphasised. As a result, by us- 
ing kernels of a range of widths, the measurement is made 
across a range of scales. The appropriate range is governed 
by the statistical constraints of the sample, developed in Sec- 
tion [2T2] and by the aim of studying the non- linear regime. 
The transition to scales where the evolution of structure be- 
comes non-linear, where modes of different scales become 
correlated, occurs at a characteristic length where the RMS 
amplitude of density perturbations is of the same order as 
the background, i.e. where 5p/ p ~ 1. Measurements of the 
variance of this fractional RMS amplitude between spheres 
of radius ?>h~^ Mpc, erg, have converged about 0.9 (Tytler 
et al.|2004 l, though disagreement persists {e.g. Spergel et al. 
20061). Here, the same value as other clustering analyses with 



the 2dFGRS data set is used: erg = 0.89. To study the weakly 
non-linear regime, therefore, is it necessary to probe around 
8fe~ ^Mp c; based on the sampling constraints outlined in Sec- 
tion 2.2 kernel widths of 5, 6, 7 and 8 ^~^Mpc have been 
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Figure 2. Surfaces of constant density through the cosmological density field of the NGP strip, extended out to a redshift of 0.2. The field 
is recreated by smoothing the statistically-complete point catalogue with a Gaussian filter, in this case of width 4 Mpc, and subsequently 
embedding it within the data array, as demonstrated in the lower panel. The images in the upper panel show the changing topology of 
the surfaces as different volume are excised: shown are surfaces containing (from top left) 33%, 66% and 100% of the volume within the 
sample. The single lower panel excises 50% of the field — corresponding to the median density — showing the geometry of the NGP region 
within the 394 X 475 X 60 array along with the effect the survey mask. At the high-redshift end of the wedge, many isolated spherical 
surfaces are apparent, indicating that Zmax = 0.2 is too large given the smoothing length; c/. Figure[T] 



chosen for both regions. These are related to the spherical 
top-hat filter radius, which corresponds more precisely to 
the notion of physical distance, by Ag ~ At/\/5; this result 
follows from the conservation of the filter volumes and the 
distinction should be kept in mind when interpreting the 
results. 

The smoothing is carried out via Fourier-space convo- 
lutiorj^of the sample with a Gaussian kernel. 



G(x) = exp 



X • X 

2A2 



(5) 



as the survey regions do not completely, or even nearly, fill 
the data volume — the slice geometry occupies approximately 
one-third of a tightly-fitting rectangular prism — there will 
be power lost through the outer surfaces of the slice from 
galaxies smoothed partially into the flagged non-survey re- 
gion. This signal is not recoverable and creates a relative 
drop in power near the slice boundaries that will, if left un- 
treated, produce a volumetric over- abundance of low-density 
region, leading to a swiss-cheese signature. To avoid this, the 



1 We use the FFTW hbraries ( |Frigo fc Johnson|2005"l v3.1.2 from 
http://www.fftw.org) for the discrete Fourier transform. 



density must be artificially raised in a manner that does not 
disturb the topological measurement. 



Melott & Dominik ( 1993 1 have demonstrated a method 



that correct for this systematic near the boundaries: cre- 
ate a three-dimensional array mirroring the geometry of the 
sample, but filled with a constant density field (say of value 
p = 1) rather than sparsely-separated galaxies. Smoothing 
this constant data array mimics the power loss through the 
surfaces of the survey boundary; dividing the smoothed 2dF- 
GRS data array cell-wise by the smoothed constant data ar- 
ray will remove this effect. Furthermore, they demonstrate 
that the only systematic effect induced by this process is 
a slight increase in the amplitude of the measured genus 
curve, a correction proportional to the ratio of surface area 
to volume that is negligible in this context; moreover, there 
is no change to the symmetry properties of the curve that 
are studied in this work. A more important penalty is the 
decrease in signal — when A is even a moderate percentage of 
the survey size, the fraction of signal lost can be quite large; 
these values are the final column of Table [T] 
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Figure 3. Isosurfaces of the SGP strip, extending to a redshift of 0.2, smoothing with a Gaussian of width A = 8/i~^ Mpc. The upper 
panels again show the surfaces containing (from top left) 33%, 66% and 100% of the volume within the sample. Of note is the significantly 
smoother structures generated by the larger smoothing kernel; the topology is very close to that of a Gaussian random field. 



3 TOPOLOGICAL ANALYSIS 

The morphology of the galaxy distribution has long been 
known to convey information from all orders of the cor- 
relation function, albeit in a way that is yet to be com- 
pletely elucidated. Unlike geometric measures of structure, 
morphological statistics concentrate on properties such as 
'connectedness' and global curvature rather than orientation 
or size. The most famous of these, the Minkowski function- 
als (|Mecke et al.||1994l [Kerscher et al.||1996[ |Schmalzing~ 
Bucliert|1997[ ), give a concise description of the density field 
up to topological equivalence; for two-dimensional surfaces 
embedded in three-dimensional space, they correspond to 
the enclosed volume, area, integrated mean curvature and 
integrated Gaussian curvature of the surface. 

The latter is proportional to the number of holes 
through the surface, called the genus number; informally 

g = number of holes — number of isolated regions -1-1. (6) 

When a surface is constructed through a three-dimensional 
field on the basis of a physical quantity, the genus number 
measures how connected (when g > 0) or disjoint (g < 0) 
regions demarcated by this quantity are. The quantity of 
interest to cosmologists is mass density; for instance, if the 
surface is drawn at the critical density for spherical collapse, 



the genus number will measure how inter-linked such regions 
are relative to the background. 

The numerical calculation of the genus number is done 
indirectly by computing the integrated curvature for surfaces 
through an array containing the smoothed density field. The 
algorithm of [Weinberg et ah] ( |1987| ) marks regions as either 
above or below a critical density and defines the surface 
as the boundary between them. The relationship between 
the genus number and integrated Gaussian curvature (on a 
smooth manifold) is the Gauss-Bonnet theorem 



/ KdA + kds 
J s J as 



■ 47r(l ■ 



(7) 



where K is the Gaussian curvature and k is the curvature 
on the boundary of the surface; the second integral vanishes 
whenever the surface is closed. Because the computation of 
the genus number takes place in a polygonal data structure, 
the surface is composed of the fiat sides of array cells along 
with the vertices and edges that join them; it is the result 
of an earlier theorem of Descartes ( Adam fc Tanneryl|1996 
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Sasaki|2003 1 on the angle defecl[^ of polyhedra that is rele- 
vant, viz. 



4ty(1 - 



(8) 



where Ai is the angle made by each face of the cell that meets 
a vertex and the outer sum is evaluated over every vertex in 
the array. This is the quantity that must be evaluated for 
each cell vertex of the array containing the smoothed density 
field; the angle deficit matrices tabulated by [Weinberg et aT] 
1 1987 1 are now used to compute the genus number of the 



surface. 



3.1 Genus curve of embedded surfaces 

The measurement of genus number is made not just on one 
surface, but on a sequence of surfaces spanning the range 
of densities contained within the field. The characteristic 
density that defines each surface is chosen to excise a fraction 
of the volume of the field, parameterised by the parameter 



(9) 



where Erfc is the conjugate error function. In this way, v — 
corresponds to splitting the field into the higher- and lower- 
mass regions that each occupy 50% of the volume; roughly 
the highest-density 15% of the volume lies above the surface 
at = ±1, 2% for u — 2 and so on as for the usual Gaussian 
integral. 

Defining the value of the density that splits the field into 
the proper volume fraction is done by reshaping the three- 
dimensional density field into an ordered one-dimensional 
array so that, as each element of the array represents a 
standard unit of volume, moving p% of the way along the 
array will give the critical density corresponding to enclos- 
ing (100 — p)% of the volume. That is, pc is the nth element 
of the sorted arrays, for 



n = fioor [(1 - Vf)N] , 



(10) 



where A'' is the length of the ordered array. 

Excising the high- and low-density regions with this 
value and measuring the genus number on the intervening 
surface gives a curve as a function of the volume param- 
eter V, I.e. u vf ^ Pc ^ g. The canonical shape of 
this curve for a Gaussian random field (formulated below in 
Equation 11 1 is shown in Figure[4] when v is negative, only 
small disjoint regions of the field are excised, so the genus 
number at this value is negative; as u increases, more disjoint 
regions appear and the genus number becomes even lower; 
eventually these regions start to join and the genus num- 
ber of the surfaces increase, crossing at v = —1; the curve 
peaks at = (median density), where the field is maxi- 
mally connected; the curve then recedes symmetrically — for 



^ Cf. the more modern sounding 'angle deficit'; while the descrip- 
tion of this measurement as an application of the Gauss-Bonnet 
theorem — which applies specifically to smooth manifolds — is 
hardly inappropriate, the more general index theorem of |Atiya"h] 
& Singer 1 1963 I is also worthy of mention. 




Figure 4. The archetypal genus of the Gaussian random field 
(solid line) normalised to the peak value a.t v = Q and (demar- 
cated by dashed lines) the regions of distinct topological character 
corresponding to the presence of isolated voids and clusters, for 
the troughs, and for the connected surface at median density. In- 
terpretations of genus curve measurements focus on differences 
between the observed curve and that of an idealised model, such 
as the Gaussian case. Departures from the model are quantified 
using the ratio between model and measurement of the area under 
three regions of the curve (shaded); this is discussed in Section[3.3| 



the case of a Gaussian field, high- and low-density regions 
are connected in the same way. 

So, there are three domains throughout the curve: i) 
< 1, the 'central peak', where the genus curve is positive, 
corresponding to those fractions of the volume that are more 
connected than isolated; ii) 1 ^ ^ 2.5, the 'troughs', 
where the genus curve is negative; and iii) |!/| ^ 2.5, the 
'wings', where the value of the curve, though still negative, 
is trending asymptotically to zero. While the crossing point 
from positive to negative values of genus number serves to 
separate the central peak from the troughs, the point of de- 
marcation between the troughs and the wings is less clear 
and somewhat more arbitrary; the points of inflection at 
±2.33 are the obvious candidates for a 
rigourous division, but this is perhaps unnecessary. While 
such a division provides little direct insight, it has provided 
the implicit basis for statistical analysis of the genus curve 



since at least Park et al. (19921; the wings, as a region of 



especially low signal-to-noise, are often excluded from mea- 
surement, while the interplay between the central peak and 
the troughs guides physical interpretation. 



3.2 Modelling of the genus curve 

To properly determine the physical significance of genus 
measurement made on data from redshift surveys, models 
of the genus curve itself are required. So far, theoretical suc- 
cess has been restricted to the special case of the Gaussian 
random field — believed to accurately represent the statisti- 
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cal properties of the density field immediately following the 
end of cosmological inflation — and small departures from 
this case, such as would be expected under smooth gravita- 
tional evolution. 

Consequently, these two descriptions are appropriate in 
different regimes: on large scales, at least A — lQh~^ Mpc, 
the genus curve equation for a Gaussian field tests whether 
the observed density field is faithfully captured by models 
of the early Universe. The perturbative model applies where 
small departures from the Gaussian case are expected, i.e. 
on scales below, but not much below, the divide between 
the linear and non-linear regimes. It is these scales that are 
probed by the current work. 

3.2.1 Linear regime: Gaussian random field 

Much of the early success of the genus curve as a diagnostic 
tool for diverging models of structure formation can be at- 
tributed to the construction of an analytic function for the 
special case of a Gaussian random field. As calculated by 



a number of independent researchers ( Doroshkevich 1970 
Adler|1981| [Bardeen et~ar||1986[ [Hamilton et al.||1986 ), the 
formula is 



ffGRF('^) = ^exp (1 - v'^) 



(11) 



where the amplitude of the curve is determined by the power 



spectrum of the field ( Hamilton et al.|1986 1 

3/2 



A 



1 

47J-2 



3 



where 



J k^P{k)W'^{k\)d^k 



(12) 



(13) 



j P{k)W^{k\)d^k ' 

When the window function is a Gaussian of width A — 
such as equation ([5| — and the power spectrum P(fc) a power 
law with spectral index n, the normalisation factor is 



A 



47r2A3 



3-f n 



3/2 



(14) 



The significance of this result is that the curve is necessarily 
symmetric for general choices of power spectrum; the result 
is applicable even for anisotropic fields; |Matsubara fc Suto| 
( 1996 1 have shown that the impact of redshift-space distor- 



tions in the linear and weakly non-linear regimes is small. 
The high- and low-density regions of the field are treated 
equally with respect to the volumes they excise and to the 
connectedness of such regions. While the term 'Gaussian 
random field' refers to the case where the Fourier ampli- 
tudes of the field are Rayleigh distributed and the phase 
distribution is uniform in the interval [0, 2n), it is the latter 
condition that is most salient when probing large volumes 
of the Universe; even fields with power distributed in a de- 
terministic fashion {e.g. P{k) oc k") axe often referred to as 
'Gaussian' when they satisfy the random phase condition; 
this amounts to an implicit invocation of the central limit 
theorem. 



3.2.2 Weakly non-linear regime: perturbation theory 

Correlations between phases such as those induced by non- 
linear gravitational evolution lead to the disruption of sym- 





S3(xl0-6) 


S2(xl0-3) 


si(xlO-^) 


so 


5(0) 


-98.96 


4.625 


-88.42 


4.007 


5(1) 


-101.6 


4.750 


-92.63 


4.104 


5(2) 


-20.83 


0.9063 


-8.792 


3.681 


Ps 


-2.604 


0.1250 


-4.208 


0.09700 


Pi 


78.13 


-3.719 


79.63 


-0.3260 



Table 2. All three skewness parameters are estimated by the 
function S(X) = X]n=o ^^^^ table shows the coefficients s„ 

for each skewness parameter, as well as the Hermite coefficients 
of equation | |15[ l. 



metry about the median density contour of the genus curve. 
The nature of this evolution has been described in a series 
of papers ( Matsubara|1994 Matsubara fc Sut"ol|1996 1 using 
cosmological perturbation theory: in a Gaussian field 5, all 
connected moments higher than n — 2 vanish; in a non- 
Gaussian field satisfying a technical requirement in which 
higher moments are of order a specific power of the rms 
fiuctuations of the field a, the genus curve is given by 



where H 



i -Aexp {-i''^/2) X 
a{P;H3{u) + PiHiiu))], 



(15) 



(u) = (-l)"exp(!/V2) (rf/di/)"exp(-;/V2) are 
the Hermite polynomials, their coefficients P„ are deter- 
mined from the skewness parameters S*-"-* of the smoothed 
field (defined as per equation (2.8) of Matsubara & Suto 



|1996[ ); the approximation to the genus formula in equa- 
tion ( 15 1 arises only from discarding terms of order and 
above; and the normalisation A is unchanged from the lin- 
ear regime. It is this equation that can be used to predict 
the genus curve on scales below the linear regime and it has 
been used in the analysis of the results. 

The genus formula for the Gaussian random field is ex- 
pressible in this formalism as well, viz. 



5GRF = -^exp {-iy^/2) H2[v), 



(16) 



where now, with only an even Hermite polynomial present, 
the genus curve is symmetric about v = — notably, this will 
not be the case in the weakly non-linear regime. The value 
of the skewness parameters, as well as the rms fiuctuation 
in amplitude of the field, change as a function of scale. The 
Hermite coefficients are given simply by 



AO) 



Pi 



-(2) 



-(0). 



(17) 



over the range of scales probed in this work, the values 
of the skewness parameters — and hence those of the Her- 
mite coefficients — can be fit by a cubic polynomial. Table [2] 
shows the polynomial coefficients of the parameters, while 
the value of a can be left as a free parameter. 

3.3 Physical meta-statistics 



Both of the models in Section [3.2| can be used as analytic fits 
to genus measurements from the 2dFGRS data. As the range 
of scales considered in this work pass well into the non-linear 
regime, it is to be expected that the genus measurement will 



deviate substantially from equation (111 



A measure of such departures was provided by |Park| 



et al. (19921, who define the genus shift parameter 
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/-i5model(i';^)'^^' 



(18) 



when the model accurately reproduces the genus measure- 
ment, Ai/ = 0, but when a meatball- or swiss-cheese-shift 
are recorded, /S.u becomes greater or less than unity, re- 
spectively. The scale-dependence of the parameter is sig- 
nificant because the non-linear effects induced by gravita- 
tional evolution cause a modest meatball-shift in the genus 
curve ( [Park fc Gott|p9T| . 

To determine which regions of the genus curve account 
for such deviations — and thus provide motivation for inter- 
pretation of the results in terms of voids and clusters — 
further meta-statistics have been defined by |Park et al.| 
(20011) andlPark et al.l (120051), 



^2.2''5model('^; 

1-2.2 
.2 



and 



(19) 



(20) 



/i.2''flmodel(^;^)'^^ 
that probe the deviation from the model in the regions about 
the troughs of the genus curve below and above = re- 
spectively, in contrast to Ai/, which is localised about the 
central peak. 



4 RESULTS 

We measure the genus statistic at 100 points in the range 
—4 ^ ^ 4 for the NGP and SGP slices smoothed on scale 
ranging from 5 to 8 /i~^Mpc. The errors on the measure- 
ments are estimated using the technique of bootstrap resam- 
pling ( |Efron|[l982[ [Barrow et al.|[r984[ ) , in which we create 
distinct samples with the same variance properties as the 
data by redrawing — with replacement — individual galaxies 
from the input 2dFGRS catalogue up to the full number of 
objects; the use of jackknife resampling has not yet been 
properly explored in the context of the genus measurement. 
After creating 100 such resampled catalogues, each one is 
subjected to the method described in Section [2] 

To test the robustness of the bootstrap resampling, we 
simulate a Gaussian random density field, with the same 
geometry as the survey data, from which 100 Poisson sam- 
plings are drawn. These are smoothed as per the data and 
the genus of the resulting distributions measured. In Fig- 
ure [5] the variance of these is compared to the variance of 
the genus measurements from the same number of boot- 
straps of a single Poisson sample. It is found that, although 
the bootstrap resamples display on average a meatball-shift, 
the variance between them is unbiased — on average, the er- 
ror bars estimated from the bootstrap are 6% smaller than 
those from the Poisson samples, but do not exhibit any 
large-scale (/-dependent trends. Though there is a consid- 
erable (~10%) tendency for the bootstrap to both over- and 
under-estimate the error bars in the regions to which A^, 
Av and Ac are sensitive, this scatter is not systematic; we 
conclude that the use of bootstrap resampling along with 
this smoothing method will not create anomolous values for 
the meta-statistics by virtue of distorting the error bars on 
the genus measurement. 

From the resulting set of mock genus measurements. 
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Figure 5. Fractional difference between the bootstrap and Pois- 
son estimates of errors for the genus of a simulated Gaussian ran- 
dom field subject to the same geometric constraints as the data 
in this work. 



the covariance matrix of the genus curve is computed for 
both slices and at each smoothing scale. Recent work ( [Hart-I 



lap et al. 2007) has demonstrated that the bootstrap is a 



biased estimator of the inverse covariance matrix: the loca- 
tion of the peak in the likelihood function is unchanged — so 
the best-fit model is correct irrespective — but the slope of 
the likelihood function in the region about the peak will 
be overestimated, making the fit appear better than it is; a 
scaling of the inverse correlation matrix , in accordance with 
Equation (17) of |Hartlap et al.| ( |2007[ ), produces an unbiased 
estimator. 

This provides a way to perform a analysis of fits 
to the measurement. For the two models discussed in Sec- 
3.2 (Equations (111 and (15l), we fit the genus mea- 



tion 



surement by minimising the resulting dispersion 



X 



n — p 



[g{iy) - gmodci{i^)f C ^ [g{u) - gmodci{iy)] , (21) 



where C~ is the re-scaled covariance matrix, n = 100 is 
the number of data points and p is the number of model 
parameters and is either one or two. 

In this section, we compare the 2dFGRS observations to 
the models and examine trends with scale, as well as differ- 
ences arising from the choice of model and meta-statistics; 
these results are related back to the distributions of voids 
and clusters. 



4.1 2dFGRS topology below the linear regime 

The genus measurements for the NGP and SGP slices are 
shown in Figure [6] along with the best-fitting Gaussian ran- 
dom field and weakly-perturbed model genus curves. When 
calculating statistics such as Au the model fitting is done by 
minimising the scatter in the region —3 3, rather than 

the traditional — 1 ^ ^ 1, in order to use information from 
all parts of the genus curve. The values of the meta-statistics 
described in Section [3. 3| for the NGP and SGP slices, taken 
relative to the best-fit genus curve of a Gaussian random 
field (eq. Ill, are given in Table |3] 

These measurements show that for both the NGP and 
SGP samples, Av deviates below zero, indicating that the 
samples are cluster-dominated at some scales (a meatball- 
shift topology); we can test the significance of this deviation 
using the statistic 



2 



S/h 

4 ^ 

\ = 5/h 



AKA)~A/.GRF y 

O-Ai'(A) / 



(22) 
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Figure 6. Genus curves for samples drawn from the NGP (left) and SGP slices of the 2dFGRS over scales from 5 to 8 h~^Mpc, with 
3(T uncertainty region estimated from the variance between bootstrap resamples of the data sets. The two fits to the measurements are 
made with the models of Equations (smooth solid line), leaving the amplitude of the curve as a free parameter, and | |15| l (dashed 
line), with both the amplitude and density rms cr varying; in several cases the two curves are nearly identical. The goodness-of-fit values 
quoted are the reduced i accounting for the additional parameter in our fit of the weakly non-linear model. 



for each of the samples, which measures the deviation from 



the Gaussian value ^v. 



GRF 



0, and which is tabulated 



within Table |4] This shows that, assuming the uncertainties 
are normally distributed, the topology of the SGP region 
is unlikely to be consistent with a Gaussian random field 
and that the NGP is definitely not to a high level of signif- 
icance. This is in agreement with the results of simulations 



presented by Park et al. ( 2005 1 for the case of biased peaks 



of the ACDM density field, and inconsistent with other cases 
including unbiased dark matter and haloes. 



The measurements also indicate that both Ac and 
are below unity, showing a lower multiplicity of clusters and 
voids relative to the case of the Gaussian random field. Ta- 
ble [4] again quantifies the departure from the Gaussian case, 
using the analogous statistics to equation ( |22[), where t he de- 
viation is now from unity rather than zero. Park et al. ( 2005 1 



have demonstrated how the value of A^ might descend be- 
low unity through (linearly) biased galaxy formation. It 
is unclear what general mechanism can drive A^ down as 
well, though a physical argument is that when the average 
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NGP 


A 


Au 


A, 


Ay 


5 


0.0063 ±0.0277 


0.9957 ± 0.1113 


0.8534 ± 0.0840 


6 


0.0158 ±0.0167 


0.8056 ± 0.0464 


0.7400 ± 0.0388 


7 


-0.0399 ±0.0166 


0.8997 ± 0.0508 


0.7378 ±0.0433 


8 


-0.0598 ±0.0150 


0.7693 ± 0.0438 


0.6348 ±0.0336 


SGP 


5 


0.0070 ± 0.0230 


0.6741 ± 0.0576 


0.8143 ±0.0670 


6 


-0.0249 ±0.0129 


0.7811 ± 0.0337 


0.7569 ± 0.0384 


7 


-0.0060 ± 0.0143 


0.8850 ± 0.0360 


0.8054 ± 0.0361 


8 


-0.0295 ±0.0148 


0.9926 ±0.0412 


0.8040 ± 0.0367 



Table 3. Asymmetry meta-statistics for the 2dFGRS samples 
measured relative to the best-fit genus curve for a Gaussian ran- 
dom field as per Equations \18\ , \19\ and | |20| l. la confidence 
limits are the standard deviation of the set of mock asymmetry 
statistics generated using the resampled genus measurements, rel- 
ative to the same model curve. 





Against the hypothesis of 


Against the hypothesis 




a Gaussian random field 


of no evolution 




2 2 2 

^Al^ ^Ay 




NGP 


5.66 12.3 50.74 


3.35 1.57 2.21 


SGP 


2.00 21.13 26.33 


0.69 6.60 0.29 



Table 4. One-sample (reduced) tests on the statistics derived 
from the genus curve measurements, testing in the first place the 
case that the density field at these scales is a Gaussian random 
field (the parameters have their Gaussian values {Au, Ac, Ay} = 
{0, 1, 1}), and in the second place that the topology of the density 
field is not evolving with scale (the parameters have their constant 
best-fit values). 



non-evolving values of the parameters. Though the two sam- 
ples do not agree on trends in Ac and Ay , it is clear that the 
topology is evolving with scale to a high statistical signifi- 
cance. The most remarkable trend is in the SGP slice, which 
displays a cluster multiplicity consistent with a Gaussian 
random field on the largest scales and decreasing smoothly 
to well-below-Gaussian levels at smaller scales. The source of 
this trend is clearly discernible in Figure|6]as a relative eleva- 
tion in the genus curves over the region in which the cluster 
multiplicity integral is calculated. Moreover, this elevation 
broadens and shifts to larger values of v on smaller scales, 
indicating an increasing degree of connection of high-density 
clusters, which i) occupy an increasingly extreme fraction of 
the sample volume and ii) extend across a widening range 
of densities. To complement this, the void multiplicity is es- 
sentially stable at sub-Gaussian field levels. 

The NGP slice also displays evolution, though the 
trends are less clear: while the void multiplicity appears to 
increase toward smaller scales, this could be a result of sup- 
pression in the number of isolated voids at larger scales due 
to large, connected over-dense structures within the NGP 
field. It is possible to test for the two samples describing the 
same evolution in each parameter using the statistic 

2 _ 1 ( ^'^GPW-^'^SGP(^) V .,o^ 

Xcross - 7 / f\\2 I ffvi ' 

4 ^t^^,^ \ V'^NGP W + '^SGP W J 

where each a is the error bar on the Au parameter and 
where analogous statistics can be defined for Ac and A^. 
The NGP and SGP samples do not show consistent trends 
and offsets in the parameter values, with the cross-statistic 
yielding 2.05 {Au), 5.16 {Ac) and 3.30 (Ay). On the scales 
probed in this work, it appears that the topology of structure 
differs modestly between the two regions at median density 
and more strongly for over- and under-dense structures. 



nearest-neighbour separation of the sites of galaxy forma- 
tion is shorter than the Gaussian case, at a fixed smoothing 
scale there will less isolated high-density peaks in the field. 
This is accompanied by greater connectivity of void regions, 
resulting in a decline in Ay ; whether this need always be the 
case is not clear. 



4.2 Scale-dependent evolution 

There are trends in each of Au, Ac and Ay as a function 
of scale shown in Figure [7| The trend in the value of Ai> 
over the scales examined from negative to zero (within the 
uncertainty) , is in partial agreement with the measurements 
on simulations by Park et al. ( 2005 I who detect a trend from 



positive to negative as A increases, with a crossing point at 
6h~^ Mpc. We observe no evidence of such a crossing in the 



SGP slice, though Matsubara & Suto ( 1996 1 show that the 



effect of velocity-space distortions on the small-scale genus 
curve suppresses the value of Av in the weakly non-linear 
regime. The value of Af in the NGP slice more accurately 
matches the predictions of both theory and simulation, mov- 
ing from below zero to zero in the range between 7 and 6h~^ 
Mpc. 

Table [Z] also gives the reduced for ths best-fitting 



4.3 Effect of model choice 

A comparison of goodness-of-fit between the Gaussian and 
weakly non-linear models for the genus curve does not pro- 
vide conclusive evidence that the weakly non-linear curve is 
superior. In most cases, the difference between the models 
is negligible, though when it is not, the weakly non-linear 
curve is much to be preferred. However, the strongest poten- 
tial confirmation of the weakly non-linear theory — that the 
quality of the fit of this model improves relative to the Gaus- 
sian one on progressively smaller scales — cannot be demon- 
strated based on these results. This reinforces the claim that 
the effect on the genus curve of accounting for weak depar- 
tures from Gaussianity is modest, though one must expect 
this will not hold as such departures grow in amplitude. 

The results could be taken to indicate that this effect 
will be best seen on scales smaller than those that have been 
probed and requires somewhat larger volumes than are avail- 
able on these scales with the present data. An interesting 
alternative is to question the impact of differing forms of 
galaxy bias on the genus curve. The 2dFGRS has been used 
(with statistics other than the genus curve) to study the lin- 
ear and quadratic bias of galaxies ( Verde et al.|2002" I as well 
as the stochasticity of the relative bias between early- and 
late- type gala:xies ( [Wild et al.||2005] ); topological statistics 
may be sensitive to a third characterisation of bias as non- 
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Figure 7. Genus curve meta-statistics At', and A„ as a function of scale for the 2dFGRS NGP and SGP survey slices, with la error 
bars determined by the variance between the same quantities calculated on each of the bootstrap resamples. Each is calculated by taking 
the ratio between the data and a model the area under different regions of the genus curve {n.b. however a factor of v in the integrand of 
the numerator in Equation |18| l; results are shown for the calculation relative to the genus curve for a Gaussian random field (solid line) 
and for its weakly non-linear perturbation (dashed), with the null hypothesis Gaussian random field parameter values (dotted) shown 
for reference. 



local, as the smoothed distribution incorporates information 
from regions around the sites of individual galaxies. These 
will certainly lead to asymmetries in the genus curve of the 
kind observed here, though it is not clear whether these will 
involve different modifications to those incorporated in the 



4.4 Comparison with larger scales 



formalism of equation ( 15 1 



It is also possible to investigate the effect of using the 
weakly non-linear genus curve model when measuring the 
parameters Ai^, Ac and Av . The dashed lines in Figure [7] 
show the value of these parameters when measured relative 
to the weakly non-linear genus curve with skewness param- 
eters selected as a function of scale. No distinction is to be 
made between the models in the SGP slice, while the val- 
ues of the parameters in the NGP slice differ at those scales 
where the weakly non-linear model provides a substantially 
better fit. 



James et al. ( 2007 1 have studied the topology of large-scale 
structure in the 2dFGRS on scales ranging from 8 to 14 
/i~^Mpc, using slightly different methodology to that em- 
ployed here with regard to the construction of statistically 
complete samples, as well as in the parameterisation of 
asymmetries in the genus curve. Figure [8] extends the re- 
sults in the present work to larger scales by calculating Ai^, 
Ac and A^ for the earlier results. 

The two sets of results are consistent at the overlapping 
scale of 8 /i~^Mpc, particularly considering the methodolog- 
ical differences. The cluster parameter in the SGP is a pos- 
sible exception to this, and indeed it seems clear that little 
inference about the scale-dependence of this statistic across 
the full range is possible given the data presented here. The 
trends that do emerge are: i) the shift parameter is consis- 
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Figure 8. Comparison of meta-statistics Au, Ac and Ay for the 2dFGRS NGP and SGP survey slices between the scales studied in this 
paper (solid lines; identical to Figure[7]l and those of [James et al.|l|2007| (dashed lines). 



tent with zero at larger scales, drops below zero at around 
10 /i~^Mpc and returns to at the smallest scales studied in 
this work; ii) the void parameter changes remarkably little 
when moving in to the non-linear regime and is consistent 
with Av ~ 0.8 across all scales and between both regions of 
the survey. This value for the void parameter is also seen 
in the topology of the Sloan sub-regions reported by |Gott| 
et al.| ( 2008 1 , as well as their measurements on the Millen- 



nium Run ( Springel et al.pOOS i and on simulations of dark 



matter haloes populated with galaxies (Park et al. 2005 



Kim fc Park||2006 l, to the point where it seems to deserve 



recognition as a generic feature of genus measurements. 

Further work in this area should aim to characterise 
the scale at which the shift parameter departs from zero 
and, if the turn-around at lower scales is reproduced in in- 
dependent samples, the scale at which the meatball-shift is 
maximised. Though the void parameter is now routinely ob- 
served to depart from unity, no complete explanation for 
this observation has emerged. The understanding of both 
of these trends would profit from examination of the differ- 
ences between particular galaxy populations. The continued 



progress of the SDSS and the early results reported by ( Park 
^raL][2005| ) auger well for this effort. Forthcoming surveys 
based on photometric redshifts are likely to make a substan- 
tial contribution through the two-dimensional genus statistic 
in redshift slices. 

It is interesting to speculate on the correspondence be- 
tween the angular- and three-dimensional correlation func- 
tions and the two- and three-dimensional genus statistics, as 
well as between the ratioing process of |Melott fc Dominik| 
( 1993 1 and the common practice of random point catalogues 



to account for geometric and astronomical selection effects in 
the calculation of correlation functions. Progress on the in- 
terpretation of the genus measurements may also require the 
use of alternative meta-statistics. A possible way forward is 
to use the orthogonality relations of the Hermite polynomi- 
als and decompose the genus curve into a series of Hermite 
functions ipniy) oc exp(— !/'^/2)iii'„(i/). The coefficients of the 
decomposition are linked to the skewness parameters of the 
field — in the linear regime only the n = 2 coefficient is non- 
zero, while in the non-linear regime the n = 1 and n = 3 
functions contribute with some small amplitude to the genus 
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curve, inducing a slight asymmetry. A full decomposition of 
the curve on scales at and below those studied in this work 
would guide theoretical interpretation and prediction of the 
topology of non-linear structure. 



5 CONCLUSIONS 

We have devised a method for measuring the genus curve 
on flux-limited samples from galaxy redshift surveys in the 
non-linear regime, setting out the requirements for avoiding 
major systematic effects in the form of a joint constraint 
diagram. This method has been applied to the 2dF Galaxy 
Redshift Survey, providing a smoothed galaxy distribution 
on which the topology of large-scale structure in the weakly 
non-linear regime, from 5 to 8 /i~^Mpc, has been studied. 
These measurements are the shortest-scale study of struc- 
ture topology carried out to date with observational data 
and have been aimed explicitly at detecting the effects of 
non-linear structure formation on the genus statistic. 

By measuring the genus shift, void multiplicity and clus- 
ter multiplicity statistics of the genus curve, we have aimed 
to quantify departures from the linear regime case of a Gaus- 
sian random field. We have observed the effects of structure 
formation on the genus curve in the form of non-linear gravi- 
tational evolution, as demonstrated by a trend in genus shift, 
and demonstrated that both the cluster and void multiplic- 
ity statistics tend to fall below unity across a broad range 
of scales. The independence with scale of the latter quan- 
tity in particular suggests effects external to the non-linear 
evolution of the dark matter field, in agreement with ear- 
lier suggestions about a relationship between Av and galaxy 
bias. 

We have studied the effect of model choice in the mea- 
surement of these statistics, using analytic predictions for 
both the linear regime and weakly non-linear regime as 
comparators to the data. We find that neither model ade- 
quately captures the shape of the genus curve, though there 
is evidence at some scales that the weakly non-linear model 
provides a substantially better fit. Together, these measure- 
ments demonstrate the efficacy of studying topology in the 
non-linear regime and, given the emphasis future surveys 
are placing on higher-order statistics, suggest that topology 
will play an important role in answering questions about the 
formation and evolution of galaxy clusters and dark matter 
haloes on these scales. 
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